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ERROR DETECTION IN LOW BIT-RATE VIDEO TRANSMISSION 

FIELD OF THE INVENTION 

5 The present invention regards video transmission, and especially a method and 
device for decoding compressed video data, wherein information about the 
spatial frequency distribution of a video data block is transformed into pixel 
values. 

10 BACKGROUND OF THE INVENTION 

One of the targets In telecommunications is to provide systems where good 
quality, real-time transmission of video, audio and data services is available. As 
is generally known, the amount of data needed to transfer moving pictures is 

15 high compared to many other types of media, and so far, usage of video in low 
bit-rate terminals has been negligible. Transmission of data in digital form, 
anyhow, has provided for increased signal-to-noise ratios and increased 
information capacity in the transmission channel. In the near future advanced 
digital mobile telecommunication systems will also be introducing services 

20 enhancing the transmission bit-rates, which means that transmission of video 
even over low bit-rate mobile channels will soon become more feasible. 

For optimisation of channel capacity usage, signals are generally compressed 
before transmission. This is especially important with video transmission, where 

25 the amount of data to be transmitted is large. Compressed video, is easily 
afflicted by transmission errors, mainly because the information content of 
compressed video is generally coded using variable length codes. When a bit 
error alters the codeword to another one of different length, the decoder loses 
synchronisation and decodes consecutive error free blocks incorrectly until the 

30 next synchronisation code is received. 




To limit the degradations in images caused by transmission errors, error 
detection and/or error correction methods can be applied, retransmissions can 
be used, and/or effects from the received corrupted data can be concealed. 
Normally retransmissions provide a reasonable way to protect data streams from 
5 errors, but long round-trip delays associated with low bit-rate transmission and 
moderate or high error rates make It practically Impossible to use retransmission, 
especially with real-time videophone applications. Error detection and correction 
methods usually require a large overhead since they add some redundancy to 
the data. Consequently, for low bit-rate applications, error concealment can be 
10 considered as a good way to protect and recover images from transmission 
errors. 

To be able to conceal transmission errors, they have to be detected and 
localised. The more Is known about the type and location of the error, the better 
15 the concealment method can be focused to the problem, and accordingly the 
better image quality will be achieved. It is also important to find methods that can 
detect especially those errors that are easily detected by the human eye. 

Lately, much Interest has been attached to error-resilient digital video 
20 transmission, but the work has mainly been concentrated on digital TV 
transmission using MPEG-2. There the problem Is solved mainly by adding 
unique sync codes frequently to the bit stream, using short packets with a cyclic 
redundancy check (CRC), and discarding all packets where the CRC Indicates 
an error. When the bit-rate of transmission is a few megabytes per second, the 
25 proportion of frequently occurring sync codes or CRC fields in the whole data 
stream is usually acceptable. However, in low bit-rate transmission the situation 
Is quite different, and with bit-rates of 20-30 kbps the optimisation of overheads 
Is extremely important. Furthermore, if the size of the picture is for example 
704*576 pixels, one 16*16 pixel macroblock covers about 0,061% of the whole 
30 picture, whereas in low bit-rate QCIF (Quarter Common Intermediate Format) 
176*144 pixel pictures, one macroblock covers more than 1% of the whole 
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image. Hence, the loss of a macroblock is more detrimental in low bit-rate 
videophone pictures than in television pictures. 

The main interest in low bit-rate video coding standardisation bodies has been to 
improve error resilience of inter coded frames. Most presented methods suggest 
changing of the bit-stream syntax and coding algorithms, whereby they can be 
properly utilised only if they are widely supported by users' videophone 
terminals. Generally two methods of error detection have been put forward: 
detection of illegal variable length coding (VLC) code words, and detection of 
missing end block codes of discrete cosine transform (DCT) matrices. In practice 
these methods have been found to be Insufficient especially for intra coded 
blocks, since a great many VLC errors remain undetected, and errors in fixed 
length coded DC components of intra coded blocks are often not detected at ail. 
Furthermore, errors are usually detected far too late, after decoding several 
corrupted blocks. 

The publication of Wai-Man Lam and Amy R. Reibman, "An error Concealment 
Algorithm for Images Subject to Channel Errors", in IEEE Transactions on Image 
processing, Vol. 4, No. 5, pp.533-542, May 1995 presents some DCT and pixel 
20 domain error detection algorithms. These algorithms, however, do not apply 
adequately to low bit-rates and low resolutions, especially due to the 
inapplicability of DCT domain algorithms for the different characteristics of 
quantised DCT matrices. 

25 The publication of Aki Hietala, "Virhesietoinen videodekoodaus", Master of 
Science Thesis, Oulu University, Department of Electrical Techniques, 1997, 
presents and analyses some methods for error detection in video bitstreams. 
The methods utilise the residual correlation of adjacent pixels (spatial 
correlation) and by detecting anomalies in block boundaries, search for 

30 corrupted blocks. However, the methods are considered rather cornplex and the 
achieved effect has not yet been sufficient. 
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The publication of M.R.Pickering, M.R.Frater, J.F.Amold, and M.W.Grigg, "An 
Error Concealment Technique in the Spatial Frequency Domain", Signal 
Processing, no.54, Elsevier 1996, pp.1 85-1 89 presents a method for concealing 
errors that are caused by blocks in the image which are similar in appearance to 
5 a single DCT basis function. In the method unusually large DCT coefficients in 
the 8*8 block of coefficients are detected and reduced to zero. This method 
works well with specific types of transmission errors, but as a single means of 
detection has a limited effect. 

1 0 SUMMARY AND OBJECT OF THE INVENTION 

Now, a new method for decoding video data blocks using variable length codes 
has been invented with which the above mentioned drawbacks can be reduced. 
The method according to the invention is characterised by generating, prior to 

15 said transformation, a first reference value representing the variations in 
information about spatial frequency distribution within the block; generating, after 
said transformation, a second reference value representing the abruptness of 
variation in certain information between the block and at least one previously 
transformed video data block; comparing the first reference value to a certain 

20 first threshold value and the second reference value to a certain second 
threshold value; and detecting an error in the block, as a response to either of 
the first and second reference values being greater than the first and 
respectively the second threshold value. 

25 An object of the invention is to provide a set of improved error detection 
elements to be combined with different steps of decoding intra coded video data 
blocks. The use of at least two of the error detection elements of the invention 
utilising information in different forms and/or stages of the decoding process will 
improve the accuracy of error detection and still not unreasonably increase the 

30 complexity of the decoding process. The use of error detection according to the 
invention enables enhanced error concealment processes and therewith 
improves the error resilience of video data transmission at low bit-rates. 
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The invented methods utilise the slowly varying nature of information in natural 
pictures by assuming a relatively high correlation between adjacent blocks. 
Blocks with shapes that are very improbable in nature can be studied more 
carefully. In the methods, relatively high correlation is expected between 
neighbouring blocks and means for noticing some very abrupt variations in bit- 
streams are presented. An unexpected anomaly in the video sequence is 
interpreted as indicating a suspicious or corrupted block, or a number of blocks 
(macroblock). 



Furthermore, a device for decoding video data is presented. The device 
comprises means for transforming information about the spatial frequency 
distribution of a video data block into pixel values; and it is characterized by 
means for generating, prior to said transformation, ' a first reference value 

15 representing the variations in information about spatial frequency distribution 
within the block; means for generating, after said transformation, a second 
reference value representing the abruptness of variation in certain information 
between the block and at least one previously transformed video data block; 
means for comparing the first reference value to a certain first threshold value 

20 and the second reference value to a certain second threshold value; and means 
for detecting an error in the block, as a response to either of the first and second 
reference values being greater than the first and respectively the second 
threshold value. 

25 BRIEF DESCRIPTION OF THE DRAWINGS 

The invention will now be described, by way of example only, with reference to 
the accompanying figures, of which: 

Figure 1 illustrates the phases of encoding and decoding intra-coded video 
30 * images; 

Figure 2 illustrates the configuration of a video image according to the H.261 
•standard; 



• Figure 3 illustrates the elements of the Invented method; 
Figure 4a illustrates the configuration of a DCT matrix; 
Figures 4b-4d illustrate different ways of dividing a DCT matrix; 
The flow chart of Figure 5a illustrates the principle of the first detection element 
5 according to the invention; 

The flow chart of Figure 5b illustrates an embodiment of the method of Fig 5a; 
Figure 6 illustrates the principle of the second detection block according to the 
invention; 

Figure 7 illustrates the principle of an embodiment of the second detection block; 
10 Figure 8a illustrates the principle of the third detection block according to the 
invention; 

Figure 8b illustrates an embodiment of a third detection block according to the 
invention; 

Figure 8c illustrates another embodiment of the method according to the 
15 invention; 

Figure 9 illustrates a functional architecture of an embodiment of the invention; 
Figure 10 illustrates an embodiment of a video image decoder according to the 
invention; and 

Figure 11 illustrates an embodiment of a mobile terminal according to the 
20 invention. 

DETAILED DESCRIPTION OF THE INVENTION 

A digital image is formed by sampling and quantising analogue picture 
25 information and transforming the generated data into a continuous stream of 
bits. The digitised signal allows the use of advanced digital signal processing 
tools, which permit faster and more efficient data transfer. Several image-coding 
algorithms have recently been developed to reduce the number of bits 
necessary for digital image representation and correspondingly reduce the bit- 
30 rates required for transmission of digital images. JPEG (Joint Photographic 
Experts Group) is a widely used algorithm for still images, CCITT (ITU 
Telecommunication Standardisation Sector) recommendation H.261 has been 
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developed for videoconferencing, H.263 for videotelephony and MPEG (Moving 
Pictures Expert Group) for transferring and storing moving video pictures. The 
block diagram of Figure 1 illustrates the basic stages of video encoding and 
decoding used in these standards and generally known to a person skilled in the 
5 art. The digital Image data is divided 1 1 into small blocks comprising a certain 
number of pixels (e.g. one block contains 8x8 pixels). The data in each block is 
transformed into the spatial-frequency domain using the Discrete Cosine 
Transform (DCT) 12. The derived DCT matrix is quantized 13 and the quantized 
signal is coded using a table of Variable Length Codewords (VLC) 14. The 
10 coded signal is transmitted to the receiver. At the receiving end the inverse 
processes 15, 16 and 17 are implemented in a reverse order to reconstruct the 
image. 

The resolution of a digital image is defined by the number of pixels in the picture 
15 matrix. Sampling with 8 bits for each of one luminance (Y) and two chrominance 
components (U, V) results in 2^^-16 million available colours. The human visual 
system is more sensitive to luminance than chrominance components, so 
generally the chrominance components of the picture are spatially 
undersampled. For example in ITU-T H.261 recommendation, for every four 
20 luminance blocks two chrominance blocks are used. As illustrated in Figure 2, 
sets of 4 luminance and 2 chrominance blocks form a macroblock 21, and an 
H.261 image 23 comprises 12 block groups 22, that are fonned by 3x11 
macroblocks. Corresponding structural grouping systems are used in other 
coding standards. 

25 

The flow chart of Figure 3 illustrates the elements of the invented method in 
connection with the steps of decoding a macroblock. The invention is based on 
the idea that blocks are an artificial way to divide information, and therefore in 
low bit-rate video sequences variations of natural images between blocks should 
30 occur slowly and/or within certain limits in an expected manner. JThe method 
comprises three separate detection elements, of which at least two are 
combined with the decoding process of variable length decoding 31, inverse 



quantization 32 and inverse DCT 33. The detection elements utilise available 
information at the different levels of decoding to detect transmission errors. The 
first detection element 34 performs steps for inspecting block level DCT 
components, and it can be performed either before .of after inverse quantization. 
5 For the purposes of the second and third detection elements, the DCT 
components of the current macroblock are temporarily stored e.g. to a volatile 
memory of the decoder. The second detection element 35 performs steps for 
block level spatial comparison, and the third detection element 36 performs 
comparisons at the macroblock level. For detection, only corresponding 
10 components are compared with each other (i.e. Y- U-, and V-components 
separately). The interpretation of detection can rely on results from studying only 
one component, or results from studying more components as well. In the 
following, the detection elements of Figure 3 will be studied in more detail. 

15 1. First detection block (34) 

After a discrete cosine transform, a pixel block can be presented as a DCT 
matrix comprising a DC coefficient and a plurality of AC coefficients, zigzag- 
scanned from lower to higher frequency coefficients as shown in figure 4a. in 
practice, it is highly improbable that there would be large amplitudes in high 

20 frequency AC components in low-resolution pictures. However, large amplitudes 
are possible, and In the invented method they are not simply filtered out, but are 
used to detect errors by appreciating the fact that high frequency AC 
components should have smaller absolute values than the lower frequency AC 
coefficients. 

25 

The flow chart of Figure 5a illustrates the principle of the first detection element 
by a simplified method to check the validity of a DCT matrix. In step 510 the AC 
components of the DCT matrix are divided into at least two groups, where 
certain higher frequency components AC36-AC63 (ref: Figure 4a) form a first 
30 group and a second group is a selected set of the remaining AC_components 
(later referred to as low frequency components). At least one first threshold 
value TH1, representing the activity in the lower frequencies is calculated from 



the AC components of the second group (step 520). Furthermore, at least one 
reference value Xref is calculated in step 530. The reference value represents 
the magnitude of non-zero coefficients, for example of the AC components of the 
first group or of the AC components of the second- group. The reference value 
5 Xref is compared (step 540) to the derived first threshold value TH1 , and if the 
reference value is greater than the threshold value (step 560), it means that an 
error is detected. 

The flow chart of Figure 5b illustrates an embodiment of the method In Figure 

10 5a, in which actually two first reference values and corresponding first threshold 
values are generated. In step 511 the DCT matrix is divided into horizontal, 
vertical, diagonal and high frequency bands. Exemplary horizontal, vertical and 
diagonal bands, further referred to collectively as low frequency bands, are 
illustrated in figures 4b, 4c, and 4d respectively. There can be some overlap 

15 between these low frequency bands. In step 512 the first low frequency band k Is 
chosen. The absolute sum abSum^ of the coefficients, the greatest absolute 
coefficient value AC^^^^ and the number of non-zero factors of abSum^ are 
calculated in step 513. In step 514 the number of non-zero coefficients in the low 
frequency band k Is checked, and if there is more than one non-zero coefficients 

20 in the band, the coefficient with the greatest absolute value AC„,^^ is subtracted 
from the absolute sum of the other non-zero coefficients abSum^, and the sum is 
added to a predefined constant value C,. The attained sum is defined 521 as an 
auxiliary first threshold THIa. If there Is only one or no non-zero coefficients, the 
auxiliary first threshold value THIa is defined 522 to be the predefined constant 

25 value C, . In step 541 the greatest absolute coefficient value AC^^,^^ (first 
reference value) is compared to the first threshold value THIa, and if the first 
reference value AC^„^ is greater or equal than the first threshold value THIa, an 
error is detected 560. If the first reference value AC^^,, is smaller than the first 
threshold value THIa, it is checked 542, whether all low frequency bands have 

30 already been examined. If not, the next one is chosen (step 543). 
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When all the low frequency bands have been examined, the high frequency 
band is also studied. In step 544 a second first threshold is derived from the 
absolute values of coefficients in the low frequency bands by choosing 
TH1b=max(C„ AC^^^^; k=1...K). After this, the first coefficient j of the high 
5 frequency band is examined (step 545). The first reference value now Xref is the 
absolute value of the chosen high frequency coefficient, and if the Xref is greater 
than the threshold TH1b (step 546), an error is detected (step 560). The loop 
(steps 546-548) is repeated until all the components in the high frequency band 
have been studied. If neither of the thresholds TH1a and TH1b are exceeded in 
10 the process, the method indicates (step 550) that no errors have been detected 
in this block at this stage. 

2. Second detection element (35) 

15 As already mentioned, variations between neighbouring blocks in natural 
pictures tend to progress relatively smoothly. Consequently, the operation of the 
second detection element is based on monitoring correlation between 
neighbouring blocks. Preferentially, the second detection element is included in 
the decoding process after the inverse discrete cosine transform. For the 

20 purposes of the second and third detection elements, the DCT components of 
the current macroblock are temporarily stored e.g. to a volatile memory of the 
decoder. The flow chart of Figure 6 illustrates the principle of the second 
detection element according to the invention. 

25 In step 610 a reference value Xcurr Is derived from the information of the current 
block. The reference value Xcurr represents a feature that presumably continues 
over the block boundaries, and can be derived in several ways, as will be shown 
later. In step 620 a corresponding reference value Xneigh is derived from the 
information of at least one neighbouring block. In step 630 the reference values 

30 are compared with each other in order to derive a difference valuejrepresenting 
the variation A of the studied feature when moving from block to block. If the 
variation is larger than a second threshold TH2 (step 640), an error is detected 



(step 660). If the variation does not exceed the second threshold TH2, no error is 
detected (step 650). The second threshold TH2 can be e.g. a predefined 
constant. 

5 In an embodiment of the Invented method, the reference value Xcurr is the DC 
component of the block. Only previously decoded blocks are available for 
comparison. If the error checking method is used during decoding, the DC 
components in blocks to the left, above, above-left and above-right are available 
for comparison. If the check Is done only after the whole frame is decoded, 

10 possible neighbours for some blocks can also be found in the row below the 
current one. If the difference between the current block and every available 
neighbouring block is larger than a certain threshold, an error in the current block 
is detected. In practice, the threshold should be rather high, since in low 
resolution Images the contents of two adjacent blocks can be quite different. 

15 Anyhow, this check requires only a few comparisons and storing of DC 
components, and therefore does not add much complexity to the decoding 
process. 

In another embodiment of the Invented method, the studied block is divided into 
20 a number of sub-blocks (for example an 8*8 block is divided into four 4*4 sub- 
blocks). For each sub-block the average of the pixel values is calculated and the 
calculated value is used as the reference value Xcurr for that sub-block. As a 
reference value of the neighbouring block Xneigh, the averaged pixel value of 
the neighbouring sub-block in left, above, above-left and above-right directions in 
25 turns are used. The variation A equals the difference between the reference 
value Xcurr and the averaged pixel value Xneigh of each of the studied 
neighbouring sub-blocks. If the difference A for a sub-block and any of its studied 
neighbours is greater than a predefined second threshold TH2, an error is 
detected. Anyhow, if such interpretation in this case seems too strong, the block 
30 .can be marked suspicious, and the check can be supplemented witb some other 
check. 
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In another embodiment of the invented method, pixels in the block boundary are 
used to check the continuity of the image. In prior art solutions, only pixels 
immediately in the block boundary are studied, but in practice, this has not 
proved to be enough. In an enhanced method, the .gradient of changes in pixel 
5 values close to the boundary Is also taken into account. The principle of such an 
embodiment of the second detection block is illustrated in Figure 7. 

In Figure 7a a boundary 70 between two adjacent blocks 71 and 72 is shown. 
Point 73 represents the value of a chosen component (e.g. luminance) of the 

10 pixel of the first block 71 closest to the boundary 70. Point 74 represents the 
value of the same component for a pixel of the second block 72 closest to the 
boundary and In the same row as the pixel of point 73. Point 75 represents the 
value the same component for a pixel of the first block 71 situated next to the 
boundary pixel 73 and farther away from the boundary 70. Point 76 represents 

15 the value of the same component for a pixel of the second block 72 situated next 
to the boundary pixel 74 and farther away from the boundary 70. First, the 
difference d1 between values of boundary pixels 73 and 74 is derived. Then, 
values 77 and 78 are extrapolated from the values of points 73/75 and 74/76 
respectively. The difference d2 between the extrapolated values is calculated, 

20 and the differences d1 and d2 are compared with each other. The smaller of 
them min(d1 , d2) is added to a cumulative sum A calculated for the boundary 70 
of the block 71 . The total sum A is compared with a predefined second threshold 
TH2, and rf the sum A is greater than TH2, the other boundaries are checked 
similarly. If the sums of all boundaries exceed TH2, an error is detected. In this 

25 example the luminance component is used for calculations, but generally any of 
the luminance and chrominance components (Y, U, V) can be used, and/or the 
check can be implemented for each of the components separately. The criterion 
can also be modified to indicate an error if the sum of one/two/three boundaries 
show values exceeding TH2. In Figure 7b the same arrangement as in Figure 

30 7a, but with different pixel values and a different direction of change is 
presented. With extrapolation, unnecessarily hasty conclusions about the 
validity/non-validity of the block can be avoided. 
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In the prior art literature some edge detectors for block boundaries have been 
presented. The embodiments shown here can be supplemented with the use of 
such edge detectors, e.g. compass gradient operators. 

5 

3. Third detection element (36) 

At the macroblock level information about a plurality of blocks can be studied 
and deviations between blocks can be examined in more detail. For the 

10 purposes of macroblock check, all or a chosen set of DCT components of the 
macroblock will be stored in a volatile memory of the decoder. The flow chart of 
Figure 8a illustrates the principle of detection methods at the macroblock level. 
In steps 810 and 820 the first block B. is received and a certain macroblock level 
parameter q. representing the feature whose variations are examined throughout 

15 the macroblock is stored in a memory 830. Information is gathered with the 
progress of loop of 810-850 until the counter j reaches the value J (step 830), 
which equals the number of blocks in the macroblock. When the whole 
macroblock has been received and the parameters q for all blocks are stored, a 
reference value or a set of reference values Qcurr is derived 860 from the 

20 parameters q^. Qcurr is checked 870 against a third threshold TH3 representing 
a limit set for Qcurr to fulfil certain predefined criteria. In case the reference 
value Qcurr is less than the third threshold TH3, no error is detected (step 890). 
If the reference value exceeds TH3, an error is detected (step 895). 

25 In an embodiment of the method as shown in Figure 8a, the reference values 
and the checking criteria are based on local spectral correlation. In practice, 
most of the visually noticeable shape information of a frame can be found in the 
luminance component. Consequently, if there are small changes in luminance 
blocks, not many changes should occur in chrominance blocks of the 

30 macroblock either. This is especially true if the image is sampled e.g. using 4:2:0 
format (i.e. four Y-blocks with one U-block and one V-block). The flow chart of 
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Figure 8b illustrates such an embodiment of a method according to the 
invention. 

Steps 810-850 follow the process illustrated with Figure 8a, except that in step 
5 831 the parameter represents the variations of the values of the AC 
coefficients in U, V, and Y blocks (AC^.^, ACy^, ACy J of the macroblock. In step 
861 values AC^^^ and AC^^ representing the amount of variation of AC 
components in U- and V-blocks Is derived, and they are processed into a value 
TH3 representing the third threshold. In step 862 a corresponding reference 

10 value ACyj representing the variation of AC components in the luminance (Y) 
blocks is derived and processed into a third reference value Qcurr. In block 871 
the reference value Qcurr and the threshold value TH3 ^re compared with each 
other and if the threshold based on variations in chrominance components (U 
and V) is much bigger than the reference number based on variations In 

15 luminance (Y) components (step 880), the macroblock is considered corrupted 
(step 895). Otherwise no errors are detected (step 890) at the macroblock level. 
Another possibility Is e.g. to study the variation in the DC-components of U- and 
V-blocks against the variations of the Y-components in comparison to some 
earlier decoded macroblock. If e.g. variation in U- and V-block DC-components 

20 exceed one auxiliary third threshold and variations in Y-block DC-components 
do not exceed another auxiliary third threshold, an error is detected. 

The flow chart of Figure 8c Illustrates another embodiment of the method 
according to the invention. The parameters to store in step 832 of the receiving 

25 loop are the DC components of the macroblock and an absolute sum of AC 
components of the macroblock. In step 863 the magnitude of variations In the 
DC-components throughout the macroblock is calculated and the absolute sum 
of AC components needed to account for the variations of the DC component is 
estimated. The estimated sum is used as a third threshold value TH3, and the 

30 actual variation in DC components is used as a third reference value. If the DC- 
components are varying noticeably and the AC coefficients are not enough to 
smooth the changes, the compatibility of the coefficients is questionable (step 



872). By comparing the reference value Qcurr to the threshold value TH3 (step 
880), the macroblock can be interpreted to be corrupted (step 895) or not (step 
890). 

5 The method herein has been presented at the macroblock level, but 
macroblocks can also be checked in rows. In the very first row of a video frame, 
there are not many neighbouring blocks or macroblocks available for 
comparison. If there are no abrupt changes in the first row and the values fall 
within a typical range, they can be considered uncorrupted. If there is any doubt, 
10 the values of the first row should be checked together with the second row. If the 
values of the first row are very different from the values in the second row, and 
the values of the second row do not contain any abrupt changes, the first row is 
probably corrupted. 

15 The flow chart of Figure 9 illustrates the functional architecture of an 
embodiment of the invention, where all three presented elements are included in 
the decoding process. First the method(s) 90 for checking the DCT components 
within blocks are implemented and the corrupted blocks are filtered out 
therewith. Even for the blocks that pass the DCT check a block check 91 is 

20 performed. Corrupted blocks are again filtered out, and suspicious blocks (i.e. 
blocks that have failed detection methods that do not detect errors but mark 
suspicious blocks) are forwarded for macroblock checking 92. Blocks that pass 
DCT check and block check and/or macroblock check are forwarded normally 
93, and the blocks that fail any of the checks are forwarded with an error 

25 indication for initiation of error concealment methods. 

The block diagram of Figure 10 illustrates an embodiment of a video image 
decoder 100 according to the invention. The decoder co:nprises an input port 
101 for receiving video image information in the form of variable length codes, 
30 and an output port 102 for outputting processed video image information. The 
decoder further comprises at least one processor 103 for inplementing the steps 
of decoding presented in Figure 1. A processor of a decoder according to the 
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invention is further arranged to Include at least two of the three presented 
detection blocks in the decoding process, and whenever justified, to add an 
indication of a detected error to the output video image information. The 
processor is also arranged to initiate a predefined error concealment process as 
5 a response to a detected error in a decoded block or a macroblock. The memory 
104 comprises at least a volatile memory for saving data during the decoding 
process. 

The functional block diagram of Figure 1 1 illustrates a generic mobile multimedia 

10 videophone terminal according to the invention. The terminal comprises a radio 
frequency unit 110 generally comprising means for transmission (e.g. channel 
coding, interleaving, ciphering, modulation and radio transmission) and for 
receiving (radio receiving, demodulation, deciphering, and channel decoding), a 
duplex filter and an antenna. The received synchronous bit stream is sent to 

15 the multiplex/demultiplex protocol unit 1 1 1 of the terminal. The Multiplex protocol 
multiplexes transmitted video, audio, data and control streams into a single bit 
stream, and demultiplexes a received bit stream into various multimedia 
streams. In addition, it performs logical framing, sequence numbering, error 
detection, and error correction, as appropriate to each media type. The control 

20 protocol 112 of the system control 113 provides end-to-end signaling for 
operation of the multimedia terminal, and signals all other end-to-end system 
functions. It provides for capability exchange, signaling of commands and 
indications, and messages to open and fully describe the content of logical 
channels. The data protocols 114 support data applications 115 such as 

25 electronic whiteboards, still image transfer, file exchange, database access, 
audiographics conferencing, remote device control, network protocols etc. The 
audio codec 116 encodes the audio signal from the audio I/O equipment 1 17 for 
transmission, and decodes the encoded audio stream. The decoded audio signal 
is played using audio I/O equipment 117. The video codec 118 comprises a 

30 video encoder 119 and a video decoder 100, and carries out_ redundancy 
reduction coding and decoding for video streams to and from the video I/O 



17 



equipment 120. The terminal accoring to the invention comprises a video 
decoder 100 as described earlier in connection with Figure 10. 

The above is a description of the realization of the invention and its 
embodiments utilizing examples. It is self-evident to a person skilled in the art 
5 that the invention is not limited to the details of the above presented 
embodiments and that the invention can also be realized in other embodiments 
without deviating from the characteristics of the invention. Especially the criteria 
for the decision of detected error and the cholse of threshold can be adjusted in 
many ways according to the application. The presented embodiments should 
10 therefore be regarded as illustrating but not limiting. Thus the possibilities to 
realize and use the invention are limited only by the enclosed claims. 



